Cross-Domain Topic Classification for Political Texts

نویسندگان

چکیده

Abstract We introduce and assess the use of supervised learning in cross-domain topic classification. In this approach, an algorithm learns to classify topics a labeled source corpus then extrapolates unlabeled target from another domain. The ability existing training data makes method significantly more efficient than within-domain learning. It also has three advantages over unsupervised models: can be specifically targeted research question resulting are easier validate interpret. demonstrate using case party platforms (source corpus) parliamentary speeches (target corpus). addition standard error metrics, we further performance by labeling subset target-corpus documents. find that classifier accurately assigns speeches, although accuracy varies substantially topic. propose tools diagnosing To illustrate usefulness method, present two studies on how electoral rules gender parliamentarians influence choice speech topics.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-Lingual Classification of Topics in Political Texts

In this paper, we propose an approach for cross-lingual topical coding of sentences from electoral manifestos of political parties in different languages. To this end, we exploit continuous semantic text representations and induce a joint multilingual semantic vector spaces to enable supervised learning using manually-coded sentences across different languages. Our experimental results show tha...

متن کامل

Multidimensional topic analysis in political texts

Automatic content analysis is more and more becoming an accepted research method in social science. In political science researchers are using party manifestos and transcripts of political speeches to analyze the positions of different actors. Existing approaches are limited to a single dimension, in particular, they cannot distinguish between the positions with respect to a specific topic. In ...

متن کامل

A link-bridged topic model for cross-domain document classification

0306-4573/$ see front matter 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.ipm.2013.05.002 ⇑ Corresponding author at: Department of Computer Science, South China University of Technology, Guangzhou, China. Tel.: +852 39438461; f 26035505. E-mail addresses: [email protected] (P. Yang), [email protected] (W. Gao), [email protected] (Q. Tan), [email protected] (K.-F. Wong)...

متن کامل

Topic Correlation Analysis for Cross-Domain Text Classification

Cross-domain text classification aims to automatically train a precise text classifier for a target domain by using labeled text data from a related source domain. To this end, the distribution gap between different domains has to be reduced. In previous works, a certain number of shared latent features (e.g., latent topics, principal components, etc.) are extracted to represent documents from ...

متن کامل

Transductive Distributional Correspondence Indexing for Cross-Domain Topic Classification

Obtaining high-quality annotated data for training a classifier for a new domain is often costly. Domain Adaptation (DA) aims at leveraging the annotated data available from a different but related source domain in order to deploy a classification model for the target domain of interest, thus alleviating the aforementioned costs. To that aim, the learning model is typically given access to a se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Political Analysis

سال: 2021

ISSN: ['1047-1987', '1476-4989']

DOI: https://doi.org/10.1017/pan.2021.37